Nonparametric imputation method for nonresponse in surveys
نویسندگان
چکیده
Many imputation methods are based on statistical models that assume that the variable of interest is a noisy observation of a function of the auxiliary variables or covariates. Misspecification of this model may lead to severe errors in estimates and to misleading conclusions. A new imputation method for item nonresponse in surveys is proposed based on a nonparametric estimation of the functional dependence between the variable of interest and the auxiliary variables. ∗Affiliation while the research was conducted: Institute of Statistics, University of Neuchâtel, Av. de Bellevaux 51, 2000 Neuchâtel, Switzerland. Current affiliation: Department of Computer and Mathematical Sciences, University of Toronto Scarborough, 1265 Military Trail, Toronto, Ontario, M1C 1A4, Canada. †Department of Statistical Sciences, University of Toronto, 100 St. Georges Street, Toronto, Ontario, M5S 3G3, Canada 1 ar X iv :1 60 3. 05 06 8v 2 [ st at .M E ] 6 F eb 2 01 7 We consider the use of smoothing spline estimation within an additive model framework to flexibly build an imputation model in the case of multiple auxiliary variables. The performance of our method is assessed via numerical experiments involving simulated and real data.
منابع مشابه
A Pseudo Empirical Likelihood Approach for Stratified Samples with Nonresponse
Nonresponse is common in surveys. When the response probability of a survey variable Y depends on Y through an observed auxiliary categorical variable Z (i.e., the response probability of Y is conditionally independent of Y given Z), a simple method often used in practice is to use Z categories as imputation cells and construct estimators by imputing nonrespondents or reweighting respondents wi...
متن کاملEstimating Variance of the Sample Mean in Two-phase Sampling with Unit Non-response Effect
In sample surveys, we always deal with two types of errors: Sampling error and non-sampling error. One of the most common non-sampling errors is nonresponse. This error happens when some sample units are not observed or viewed but they do not answer some of the questions. The complete prevention of this error is not possible, but it can be significantly reduced. The non-response causes bias and ...
متن کاملCreating Imputation Classes Using Classification Tree Methodology
Virtually all surveys encounter some level of item nonresponse. To address this potential source of bias, practitioners often use imputation to replace missing values with valid values through some form of stochastic modeling. In order to improve the reliabilities of such models, imputation classes are formed to produce homogenous groups of respondents, where homogeneity is measured with respec...
متن کاملNonparametric Markov chain bootstrap for multiple imputation
Multiple imputation is a statistical method for analyzing data with missing values. Nonparametric Markov chain bootstrap methods can be used to generate multiple imputations of both scalar and multivariate outcome variables, under the assumption that the data are missing completely at random, and nonparametric inference can be obtained using multiple implementation bootstrap. The nonparametric ...
متن کاملNonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys
In many surveys, the data comprise a large number of categorical variables that suffer from item nonresponse. Standard methods for multiple imputation, like log-linear models or sequential regression imputation, can fail to capture complex dependencies and can be difficult to implement effectively in high dimensions. We present a fully Bayesian, joint modeling approach to multiple imputation fo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016